Introduction

This period I would like to investigate how song lyrics correlate with music properties such as modality, energy levels and dynamic pitch range across different genres. More specifically, the research focuses on relating the words contained by the lyrics to musical properties. I find it highly interesting to see how the mood and emotionality of a song/genre affect a sing and songwriter when writing the lyrics. I expect some obvious results, such that hip hop is generally more about the ‘hood’ than rock music, and perhaps that songs in a minor mode deal with sad topics more frequently than major mode songs.

Because the lyrical vocabulaire is extremely rich, a large, diverse dataset is of the essence. To accomplish this and to keep the corpus representative, I will put together a corpus that draws inspiration from a broad range of genres. This includes mainstream genres such as pop music, but also more obscure ones such as industrial hip hop, as unexpected yet interesting patterns may emerge. I will dissect the word usage for each genre and then compare word usages of different genres. It would be interesting to know if genres with similar lyrics have similar properties.

A list of albums per genre that make up the corpus (for now):

Pop: Midnights (Taylor Swift); WHEN WE ALL FALL ASLEEP, WHERE DO WE GO? (Billie Eilish); Dua Lipa (Dua Lipa)

Hip Hop: Madvillainy (MF DOOM, Madlib); ASTROWORLD (Travis Scott); HEROES & VILLAINS (Metro Boomin)

Alternative Rock: Elephant (The White Stripes); ..Like Clockwork (Queens of the Stone Age); Street Worms (Viagra Boys)

Industrial hip hop: The Money Store (Death Grips); OFFLINE! (JPEGMAFIA); Visions of Bodies Being Burned (clipping.)

Classical (translated to english): Wilhelmus (Marnix Van St. Aldegonde), Negende symfonie (Beethoven)

Comparison of tracks within an album


Before we shall discover inter-album relations, let us commit to a single album, such that we can shape an idea what the variation within an album might look like.

Comparison of different albums


Intuitively, tempo and energy are correlating factors of a song. We imagine high energy songs generally have a fast tempo. Let us investigate this thought by plotting the data for number of albums. As you can see, some albums tend to be limited in their energy range, whilst others are more or less contained.

Conclusion

What is there not to say. We live in a data driven society that harbors as many music tastes as there are colors in a van Gogh painting. But just like a van Gogh painting, you can dissect it and scrutinize the most elementary aspects, from its radiance to its perspective on the cruelties and absurdities of society. We looked at energy levels, at tempo, a variety of tracks and albums, from a point of view of strict objectiveness, one that our primordial ancestors would not even be able to fathom. One could draw an infinite number of conclusions, some might jump out more than others. What is a data driven society without conclusions?